In-database connected component analysis

نویسندگان

  • Harald Bögeholz
  • Michael Brand
  • Radu-Alexandru Todor
چکیده

We describe a Big Data-practical, SQL-implementable algorithm for efficiently determining connected components for graph data stored in a Massively Parallel Processing (MPP) relational database. The algorithm described is a linear-space, randomised algorithm, always terminating with the correct answer but subject to a stochastic running time, such that for any ǫ > 0 and any input graph G = 〈V,E〉 the algorithm terminates after O(log |V |) SQL queries with probability of at least 1− ǫ, which we show empirically to translate to a quasi-linear runtime in practice. Monash University, Melbourne, Australia. Email: [email protected] Monash University and Telstra Corporation, Melbourne, Australia. Email: [email protected] UBS, Zürich, Switzerland. Email: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Connected Component Based Word Spotting on Persian Handwritten image documents

Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...

متن کامل

Review and comparison of User Interface Characteristics of (Springer, Elsevier, Ebsco, ISI(WOS) and Ovid) as Perceived by University of Tehran Users

Background and Aim: The present investigation intends to compare and review various user interfaces from user standpoint and to ascertain its linkage with user satisfaction. Method: The research incorporated a descriptive survey of University of Tehran graduate student body. Using a targeted sampling, graduate students from the faculties of chemistry and Biology were selected. The instruments u...

متن کامل

The Use of a Selective Database Technique in Order to Recover the Spectra of a Series of Acrylic Paints by the Principle Component Analysis

A procedure for an efficient recovering of reflectance spectra of Acrylic paint samples from CIE tristimulus color values is described. By fixing a certain criteria based on color difference value, the proposed technique preliminarily selects a series of suitable samples from a main dataset containing the reflectance values of a series of different Acrylic paint samples, based on the color ...

متن کامل

Towards Text Recognition in Natural Scene Images

In this paper, we propose a novel methodology for text detection in natural scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully processes natural scene images having shadows, non-uniform illumination, low contrast and large signaldependent noise. Conn...

متن کامل

Text Detection in Indoor/Outdoor Scene Images

In this paper, we propose a novel methodology for text detection in indoor/outdoor scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully process indoor/ outdoor scene images having shadows, non-uniform illumination, low contrast and large signal-depende...

متن کامل

Modeling and Availability Analysis of Internet Data Center with various Maintenance Policies

In this paper, the authors have focused on the stochastic analysis of an internet data center (IDC), which consists of a database main server connected to a redundant server. Observing the different possibilities of functioning of the system, analysis has been done to evaluate the various reliability characteristics of the system. The system can completely fail due to failure of redundant serve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1802.09478  شماره 

صفحات  -

تاریخ انتشار 2018